Picture for Zhengzhong Liu

Zhengzhong Liu

PR2: Predictive Routing Replay for MoE-Based LLM Reinforcement Learning

Add code
May 29, 2026
Viaarxiv icon

Efficient Agentic Reasoning Through Self-Regulated Simulative Planning

Add code
May 21, 2026
Viaarxiv icon

EMO: Frustratingly Easy Progressive Training of Extendable MoE

Add code
May 14, 2026
Viaarxiv icon

CocoaBench: Evaluating Unified Digital Agents in the Wild

Add code
Apr 14, 2026
Viaarxiv icon

World Reasoning Arena

Add code
Mar 26, 2026
Viaarxiv icon

IsoCompute Playbook: Optimally Scaling Sampling Compute for LLM RL

Add code
Mar 12, 2026
Viaarxiv icon

PAN: A World Model for General, Interactable, and Long-Horizon World Simulation

Add code
Nov 15, 2025
Figure 1 for PAN: A World Model for General, Interactable, and Long-Horizon World Simulation
Figure 2 for PAN: A World Model for General, Interactable, and Long-Horizon World Simulation
Figure 3 for PAN: A World Model for General, Interactable, and Long-Horizon World Simulation
Figure 4 for PAN: A World Model for General, Interactable, and Long-Horizon World Simulation
Viaarxiv icon

K2-Think: A Parameter-Efficient Reasoning System

Add code
Sep 09, 2025
Viaarxiv icon

Vision-G1: Towards General Vision Language Reasoning with Multi-Domain Data Curation

Add code
Aug 18, 2025
Figure 1 for Vision-G1: Towards General Vision Language Reasoning with Multi-Domain Data Curation
Figure 2 for Vision-G1: Towards General Vision Language Reasoning with Multi-Domain Data Curation
Figure 3 for Vision-G1: Towards General Vision Language Reasoning with Multi-Domain Data Curation
Figure 4 for Vision-G1: Towards General Vision Language Reasoning with Multi-Domain Data Curation
Viaarxiv icon

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Add code
Jun 17, 2025
Figure 1 for Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective
Figure 2 for Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective
Figure 3 for Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective
Figure 4 for Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective
Viaarxiv icon